Corpus: heb_news_2008_30K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 14153 ה-
2 10185 ו-
3 9101 מ-
4 8258 ב-
5 8032 ש-
Top Character Bigrams
word rank frequency n-gram
1 2497 המ-
2 1970 וה-
3 1551 שה-
4 1329 ומ-
5 1298 מה-
Top Character Trigrams
word rank frequency n-gram
1 373 המו-
2 256 להת-
3 247 והמ-
4 231 המת-
5 231 המש-
Top Character 4-Grams
word rank frequency n-gram
1 59 המשו-
2 50 האינ-
3 48 ב-19-
4 45 אינט-
5 41 והמו-
Top Character 5-Grams
word rank frequency n-gram
1 28 פירסו-
2 23 האינט-
3 19 ב-200-
4 18 האירו-
5 18 אינטר-
735 msec needed at 2018-03-07 08:28